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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANTS: Petkovich, P. Martin, White, Jay A. , 

Beckett, Barbara R-, Jones, Glenville 

(ii) TITLE OF INVENTION: Retinoid Metabolizing Protein 
(iii) NUMBER OF SEQUENCES: 43 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Torys LLP 

tB) STREET: 3000 - 79 Wellington Street West 

CO CITY; Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: M5K 1N2 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3 1/2 inch, 1.4 Mb storage 

(B) COMPUTER: COMPAQ, IBM PC compatible 

(C) OPERATING SYSTEM: MS-DOS 5.1 

(D) SOFTWARE: WORD PERFECT 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/668,482 

(B) FILING DATE J September 25, 2000 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBERS: 08/667,54 6; 03/724,4 66; PCT/CA97/004 40; 

(B) FILING DATE: June 21, 1996; October 1, 1996; June 23, 1997; 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Hunt, John C, 

(B) REGISTRATION NUMBER: 36,424 

(C) REFERENCE /DOCKET NUMBER: 32391-2005 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE; (416) 865-8121 

(B) TELEFAX: (416) 865-7390 



(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

TGCCAGTGGA CAATCTCCCT ACCAAATTCA CTAGTTATGT CCAGAAATTA GCCTAAACCG 60 

GAGCCTTTGT ACATATGTTT TTATTTTAGA TGAACTGTGA TGTATTGGAT ATTTTCTAAT 120 

TTGTTTATAT AAAGCAGATG TGTATATAAG TCTATGCGAA GAAGCGAAAA CGAGGGCACT 180 
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ACTTTCTCAT GGATCACTGT AATGCTACAG AGTGTCTGTG ATGTATATTT ATAATGTAGT 240 
TGTGTCATAT AGCTTTTGTA CTGTATGCAA CTTATTTAAC TCGCTCTTTA TCTCATGGGT 300 
TTTATTTAAT AAAACATGTT CTTACAAAAA AAAAAAA 337 



(2) INFORMATION FOR SEQ ID NO; 2 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 4 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

Met Gly Leu Tyr Thr Leu Met Val Thx Phe Leu Cys Thr lie Val Leu 
1 5 10 15 

Pro val Leu Leu Phe Leu Ala Ala Val Lys Leu Trp Glu Met Leu Met 

20 25 30 

lie Arg Arg Val Asp Pro Asn Cys Arg Ser Pro Leu Pro Pro Gly Thr 
35 40 45 

Met Gly Leu Pro Phe lie Gly Glu Thr Leu Gin Leu lie Leu Gin Arg 
50 55 60 

Arg Lys phe Leu Arg Met Lys Arg Gin Lys Tyr Gly Cys He Tyr Lys 
65 70 75 80 

Thr His Leu Phe Gly Asn Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Gin He Leu Leu Gly Glu His Lys Leu Val Ser Val Gin Trp 

100 105 HO 

Pro Ala Ser Val Arg Thr He Leu Gly Ser Asp Thr Leu Ser Asn Val 
115 120 125 

His Gly Val Gin His Lys Asn Lys Lys Lys Ala He Met Arg Ala Phe 
130 135 140 

Ser Arg Asp Ala Leu Glu His Tyr He Pro Val He Gin Gin Glu Val 
145 150 155 160 

Lys Ser Ala He Gin Glu Trp Leu Gin Lys Asp Ser Cys Val Leu Val 

165 170 175 

Tyr Pro Glu Met Lys Lys Leu Met Phe Arg He Ala Met Arg lie Leu 

180 IBS 190 

Leu Gly Phe Glu Pro Glu Gin He Lys Thr Asp Glu Gin Glu Leu Val 
195 200 205 
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Glu Ala Phe Glu Glu Met He Lys Asn Leu Phe Ser Leo Pro He Asp 
210 215 220 

Val Pro Phe Ser Gly Leu Tyr Arg Gly Leu Arg Ala Arg Asn Phe He 
225 230 235 240 

His Ser Lys lie Glu Glu Asn He Arg Lys Lys He Gin Asp Asp Asp 

245 250 255 

Asn Glu Asn Glu Gin Lys Tyr Lys Asp Ala Leu Gin Leu Leu He Glu 

260 265 210 

Asn Ser Arg Arg Ser Asp Glu Pro Phe Ser Leu Gin Ala Met Lys Glu 
275 280 285 

Ala Ala Thr Glu Leu Leu Phe Gly Gly His Glu Thr Thr Ala Ser Thr 
290 295 300 

Ala Thr Ser Leu Val Met Phe Leu Gly Leu Asn Thr Glu Val Val Gin 
305 310 315 320 

Lys Val Arg Glu Glu Val Gin Glu Lys Val Glu Met Gly Met Tyr Thr 

325 330 335 

Pro Gly Lys Gly Leu Ser Met Glu Leu Leu Asp Gin Leu Lys Tyr Thr 

340 345 350 

Gly Cys Val He Lys Glu Thr Leu Arg He Asn Pro Pro Val Pro Gly 
355 360 365 

Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn Gly Tyr Gin lie 
370 375 380 

Pro Lys Gly Trp Asn Val He Tyr Ser He Cys Asp Thr His Asp Val 
385 390 395 400 

Ala Asp Val Phe Pro Asn Lys Glu Glu Phe Gin Pro Glu Arg Phe Met 

405 410 415 

Ser Lys Gly Leu Glu Asp Gly Ser Arg phe Asn Tyr He Pro Phe Gly 

420 425 430 

Gly Gly Ser Arg Met Cys val Gly Lys Glu Phe Ala Lys Val Leu Leu 
435 440 445 

Lys He Phe Leu Val Glu Leu Thr Gin His Cys Asn Trp lie Leu Ser 
450 455 460 

Asn Gly Pro Pro Thr Met Lys Thr Gly Pro Thr He Tyr Pro Val Asp 
465 470 475 480 

Asn Leu Pro Thr Lys Phe Thr Ser Tyr Val Arg Asn 

485 490 
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(2) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1B50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDS ONES S ; single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

TGTCGCCGTT GCTGTCGGTT GCTGTCGGAC GCTGTCTCCT CTCCAGAAGC TTGTTTTTCG 60 

TTTTGGCGAT CAGTTGCGCG CTTCAAC ATG GGG CTG TAC ACC CTT ATG GTC ACC 114 

Met Gly Leu Tyr Thr Leu Met Val Thr 
1 5 

TTT CTC TGC ACC ATC GTG CTA CCC GTT TTA CTC TTT CTC GCC GCG GTG 162 
Phe Leu Cys Thr lie Val Leu Pro Val Leu Leu Phe Leu Ala Ala Val 
10 15 20 25 

AAG TTG TGG GAG ATG TTA ATG ATC CGA CGA GTC GAT CCG AAC TGC AGA 210 
Lys Leu Trp Glu Met Leu Met lie Arg Arg Val Asp Pro Asn Cys Arg 

30 35 40 

AGT CCT CTA CCG CCA GGT ACC ATG GGC TTG CCG TTC ATT GGA GAA ACG 258 
Ser Pro Leu Pro Pro Gly Thr Met Gly Leu Pro Phe lie Gly Glu Thr 

45 50 55 

CTC CAG CTG ATC CTC CAG AGA AGG AAG TTT CTG CGC ATG AAA CGG CAG 306 
Leu Gin Leu lie Leu Gin Arg Arg Lys Phe Leu Arg Met Lys Arg Gin 
60 65 70 

AAA TAC GGG TGC ATC TAC AAG ACG CAC CTC TTC GGG AAC CCG ACT GTC 354 
Lys Tyr Gly Cys lie Tyr Lys Thr His Leu Phe Gly Asn Pro Thr Val 
75 80 85 

AGG GTG ATG GGA GCT GAT AAT GTG AGG CAG ATT CTG CTG GGC GAA CAC 402 
Arg Val Met Gly Ala Asp Asn Val Arg Gin lie Leu Leu Gly Glu His 
90 95 100 105 

AAG CTG GTG TCT GTT CAG TGG CCA GCA TCA GTG AGA ACC ATC CTG GGC 4 50 

Lys Leu Val Ser Val Gin Trp Pro Ala Ser Val Arg Thr He Leu Gly 

110 115 120 

TCT GAC ACC CTC TCC AAT GTC CAT GGA GTT CAA CAC- AAA AAC AAG AAA 498 
Ser Asp Thr Leu Ser Asn Val His Gly Val Gin His Lys Asn Lys Lys 

125 130 135 

AAG GCC ATT ATG AGG GCG TTC TCT CGA GAT GCT CTG GAG CAC TAC ATT 54 6 

Lys Ala lie Met Arg Ala Phe Ser Arg Asp Ala Leu Glu His Tyr lie 
140 145 150 

CCC GTG ATC CAG CAG GAG GTG AAG AGC GCC ATA CAG GAA TGG CTG CAA 594 
Pro Val He Gin Gin Glu Val Lys Ser Ala He Gin Glu Trp Leu Gin 
155 160 165 
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Leu Asn Thr Glu Val Val Gin Lys Val Arg Glu Glu Val Gin Glu Lys 
315 320 325 

GTT GAA ATG GGC ATG TAT ACA CCT GGA AAG GGC TTG AGT ATG GAG CTG 1122 
Val Glu Met Gly Met Tyr Thr Pro Gly Lys Gly Leu Ser Met Glu Leu 
330 335 340 345 

TTG GAC CAG CTG AAG TAC ACT GGA TGT GTG ATT AAA GAG ACT CTT AGA 117 0 

Leu Asp Gin Leu Lys Tyr Thr Gly Cys Val lie Lys Glu Thr Leu Arg 

350 355 360 

ATC AAC CCT CCT GTT CCC GGA GGA TTC AGA GTC GCA CTC AAA ACC TTT 1218 
lie Asn Pro Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe 

365 370 375 

GAA TTG AAT GGT TAC CAA ATT CCT AAA GGA TGG AAC GTC ATT TAC AGC 1266 
Glu Leu Asn Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser 
380 365 390 



- 5/28 - 

j 

AAA GAC TCC TGC GTG CTG GTT TAT CCA GAA ATG AAG AAA CTC ATG TTT 642 
Lys A3p Ser Cys Val Leu Val Tyr Pro Glu Met Lys Lys Leu Met Phe 
170 175 180 195 

4 
1 

. . . i 

CGQ ATA GCT ATG AGA ATC CTG CTT GGT TTT GAA CCA GAG CAA ATA AAG 690 
Arg lie Ala Met Arg lie Leu Leu Gly Phe Glu Pro Glu Gin lie Lys * 

190 195 200 

ACG GAC GAG CAA GAA CTG GTG GAA GCT TTT GAG GAA ATG ATC AAA AAC 736 : 

Thr Asp Glu Gin Glu Leu Val Glu Ala Phe Glu Glu Met lie Lys Asn ) 

205 210 215 : 

t 

TTG TTC TCC TTG CCA ATC GAC GTT CCT TTC AGT GGT CTG TAC AGG GGT 786 \ 

Leu Phe Ser Leu Pro lie Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly 
220 225 230 

■ 

TTG AGG GCA CGC AAT TTC ATT CAC TCC AAA ATT GAG GAA AAC ATC AGG 834 \ 

Leu Arg Ala Arg Asn Phe lie His Ser Lys lie Glu Glu Asn lie Arg , 
235 240 245 

AAG AAA ATT CAA GAT GAC GAC AAT GAA AAC GAA CAG AAA TAC AAA GAC 882 ' 

Lys Lys lie Gin Asp Asp Asp Asn Glu Asn Glu gin Lys Tyr Lys Asp ; 
250 255 260 265 I 

GCC CTT CAG CTG TTG ATC GAG AAC AGC AGA AGA A3T GAC GAA CCT TTT 930 
Ala Leu Gin Leu Leu He Glu Asn Ser Arg Arg Ser Asp Glu Pro Phe : 

270 275 280 I 

AGT TTG CAG GCG ATG AAA GAA GCA GCT ACA GAG CTT CTA TTT GGA GGT 976 1 

Ser Leu Gin Ala Met Lys Glu Ala Ala Thr Glu Leu Leu Phe Gly Gly i 

285 290 295 j 

CAT GAA ACC ACC GCC AGC ACT GCA ACC TCA CTT GTC ATG TTT CTG GGT 1026 S 

His Glu Thr Thr Ala Ser Thr Ala Thr Ser Leu Val Met Phe Leu Gly ■ 
300 305 310 

1 

1 

CTC AAC ACA GAA GTG GTG CAG AAG GTC AGA GAG GAG GTT CAG GAG AAG 1074 ' 
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ATC TGT GAC ACG CAC GAT GTG GCC GAC GTC TTT CCA AAC AAA GAG GAG 1314 
lie Cys Asp Thr His Asp Val Ala Asp Val Phe Pro Asn Lys Glu Glu 
395 400 405 

TTC CAG CCG GAG AGA TTC ATG AGC AAA GGT CTG GAG GAC GGG TCC AGG 1362 
Phe Gin Pro Glu Arg Phe Met Ser Lya Gly Leu Glu Asp Gly Ser Arg 
410 415 420 425 

TTT AAC TAG ATC CCC TTC GGA GGA GGA TCC AGG ATG TGT GTG GGC AAA 1410 
Phe Asn Tyr He Pro Phe Gly Gly Gly Ser Arg Met Cys Val Gly Lys 

430 435 440 

GAG TTC GCC AAA GTG TTA CTC AAG ATC TTT TTA GTT GAG TTA ACG CAG 14 58 

Glu Phe Ala Lys Val Leu Leu Lys lie Phe Leu Val Glu Leu Thr Gin 

445 450 455 

CAT TGC AAT TGG ATT CTC TCA AAC GGA CCC CCG ACA ATG AAA ACA GGC 1506 
His Cys Asn Trp lie Leu Ser Asn Gly Pro Pro Thr Met Lys Thr Gly 
460 465 470 

CCG ACT ATT TAG CCA GTG GAC AAT CTC CCT ACC AAA TTC ACT AGT TAT 1554 
Pro Thr He Tyr Pro Val Asp Asn Leu Pro Thr Lys Phe Thr Ser Tyr 
475 480 485 

GTC AGA AAT TAGCCTAACC GGAGCTTTGT ACATATGTTT TTATTTTAGA 1603 

Val Arg Asn 

490 

TGAACTGTGA TGTATTGGAT ATTTTCTATT TTGTTTATAT AAAGCAGATG TGTATATAAG 1663 

TCTATGCGAG GAAGCGAAAA CGAGGGCACT ACTTTCTCAT GGATCACTGT AATGCTACAG 1723 

AGTGTCTGTG ATGTATATTT ATAATGTAGT TGTGTTATAT AGCTTTTGTA CTGTATGCAA 1783 

CTTATTTAAC TCGCTCTTTA TCTCATGGGT TTTATTTAAT AAAACATGTT CTTACAAAAA 184 3 



AAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 97 amino acids 

(B) TYPE: amino acid 

£C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



1850 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 4 

Met Gly L«*u Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 

1 5 10' 15 

Pro Leu Leu Leu Phe Leu Ala Ala lie Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

Val Ser Gly Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 
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Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 

Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe lie Tyr Lys 
65 70 75 80 

Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

B5 90 95 

val Arg Arg lie Leu Leu Gly Asp Asp Arg Leu Val Ser val His Trp 

100 105 HO 

Pro Ala Ser Val Arg Thr He Leu Gly Ser Gly Cys Leu Ser Asn Leu 
115 120 125 

His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Arg Ala Phe 
130 135 140 

Ser Arg Glu Ala Leu Glu Cys Tyr Val Pro Val lie Thr Glu Glu Val 
145 150 155 160 

Gly Ser Ser Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 170 175 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg He Ala Met Arg He 

180 135 190 

Leu Leu Gly Cys Glu Pro Gin Leu Ala Gly Asp Gly Asp Ser Glu Gin 
195 200 205 

Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

Pro He Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly Met Lys Ala Arg 
225 230 235 240 

Asn Leu He His Ala Arg He Glu Gin Asn He Arg Ala Lys He Cys 

245 250 255 

Gly Leu Arg Ala Ser Glu Ala Gly Gin Gly Cys Lys Asp Ala Leu Gin 

260 265 270 

Leu Leu He Glu His Ser Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 
275 280 285 

Ala Leu Lys Gin Ser Ser Thr Glu Leu Leu Phe Gly Gly His Glu Thr 
290 295 300 

Thr Ala Ser Ala Ala Thr Ser Leu He Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

His Val Leu Gin Lys Val Arg Glu Glu Leu Lys Ser Lys Gly Leu Leu 

325 330 335 

Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu He Leu Glu Gin 

340 345 350 



PAGE 23/49 ' RCVD AT 4/29/2004 2:04:25 PM [Eastern Daylight Time] ' SVR:USPT0-EFXRF-1/3 * DNIS:8729306 * CSID:416 865 7380 < DURATION (mra-ss):12-32 



APR-29-2004 14=13 



TORYS LLP TORONTO 



416 865 7380 



P. 29 



- 8/28 - 



Leu Lys Tyr lie Gly Cys Val lie Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 

Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 

Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser He Cys Asp 
385 390 395 400 

Thr His Asp Val Ala Glu He Phe Thr Asm Lys Glu Glu Phe Asn Pro 

405 410 415 

Asp Arg Phe Ser Ala Pro His Pro Glu Asp Ala Ser Arg Phe Ser Phe 

420 425 430 

He Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 

Lys He Leu Leu Lys He Phe Thr Val Glu Leu Ala Arg His Cys Asp 
450 455 460 

Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 

Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr His Phe His Gly Glu 

485 490 495 

He 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 94 bas© pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 5 



ATG GGG CTC CCG GCG CTG CTG GCC AGT GCG CTC TGC ACC TTC GTG CTG 
Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
15 10 15 



48 



CCG CTG CTG CTC TTC CTG GCT GCG ATC AAG CTC TGG GAG CTG TAC TGC 
Pro Leu Leu Leu Phe Leu Ala Ala He Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 



96 



GTG AGC GGC CGC GAC CGC AGT TGT GCC CTC CCA TTG CCC CCC GGG ACT 
Val Ser Gly Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 



144 



ATG GGC TTC CCC TTC TTT GGG GAA ACC TTG CAG ATG GTA CTG CAG CGG 
Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 



192 
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AGG AAG TTC CTG CAG ATG AAG CGC AGG AAA TAC GGC TTC ATC TAC AAG 240 
Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe lie Tyr Lys 
65 70 75 90 

ACG CAT CTG TTC GGG CGG CCC ACC GTA CGG GTG ATG GGC GCG GAC AAT 288 
Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

GTG CGG CGC ATC TTG CTC GGA GAC GAC CGG CTG GTG TCG GTC CAC TGG 336 
Val Arg Arg lie Leu Leu Gly Asp Asp Arg Leu Val Ser Val His Trp 

100 105 110 

CCA GCG TCG GTG CGC ACC ATT CTG GGA TCT GGC TGC CTC TCT AAC CTG 384 
Pro Ala Ser Val Arg Thr lie Leu Gly Ser Gly Cys Leu Ser Asn Leu 
115 120 125 

CAC GAC TCC TCG CAC AAG CAG CGC AAG AAG GTG ATT ATG CGG GCC TTC 4 32 

His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Arg Ala Phe 
130 135 140 

AGC CGC GAG GCA CTC GAA TGC TAC GTG CCG GTG ATC ACC GAG GAA GTG 480 
Ser Arg Glu Ala Leu Glu Cys Tyr Val Pro Val lie Thr Glu Glu Val 
145 150 155 160 

GGC AGC AGC CTG GAG CAG TGG CTG AGC TGC GGC GAG wGC GGC CTC CTG 529 
Gly Ser Ser Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 170 175 

GTC TAC CCC GAG GTG AAG CGC CTC ATG TTC CGA ATC GCC ATG CGC ATC 57 6 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg He Ala Met Arg lie 

160 185 190 

CTA CTG GGC TGC GAA CCC CAA CTG GCG GGC GAC GGG GAC TCC GAG CAG 624 
Leu Leu Gly Cys Glu Pro Gin Leu Ala Gly Asp Gly Asp Ser Glu Gin 
195 200 205 

CAG CTT GTG GAG GCC TTC GAG GAA ATG ACC CGC AAT CTC TTC TCG CTG 672 
Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

CCC ATC GAC GTG CCC TTC AGC GGG CTG TAC CGG GGC ATG AAG GCG CGG 720 
Pro He Asp Val Pro Ph« Ser Gly Leu Tyr Arg Gly Met Lys Ala Arg 
225 230 235 240 

AAC CTC ATT CAC GCG CGC ATC GAG CAG AAC ATT CGC GCC AAG ATC TGC 76B 
Asn Leu He His Ala Arg He Glu Gin Asn He Arg Ala Lys He Cys 

245 250 255 

GGG CTG CGG GCA TCC GAG GCG GGC CAG GGC TGC AAA GAC GCG CTG CAG 816 
Gly Leu Arg Ala Ser Glu Ala Gly Gin Gly Cys Lys Asp Ala Leu Gin 

260 265 270 

CTG TTG ATC GAG CAC TCG TGG GAG AGG GGA GAG CGG CTG GAC ATG CAG 864 
Leu Leu He Glu His S*r Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 
275 280 285 
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GCA CTA AAG CAA TCT TCA ACC GAA CTC CTC TTT GGA GGA CAC GAA ACC 912 
Ala Leu Lys Gin Ser Ser Thr Glu L@u Leu Phe Gly Gly His Glu Thr 
290 295 300 

ACG GCC AGT GCA GCC ACA TCT CTG ATC ACT TAC CTG GGG CTC TAC CCA 960 
Thr Ala Ser Ala Ala Thr Ser Leu He Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

CAT GTT CTC CAG AAA GTG CGA GAA GAG CTG AAG AGT AAG GGT TTA CTT 1008 
His Val Lea Gin Lys Val Arg Glu Glu Leu Lys Ser Lys Gly Leu Leu 

325 330 335 

TGC AAG AGC AAT CAA GAC AAC AAG TTG GAC ATG GAA ATT TTG GAA CAA 1056 
Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu He Leu Glu Gin 

340 345 350 

CTT AAA TAC ATC GGG TGT GTT ATT AAG GAG ACC CTT CGA CTG AAT CCC 1104 
Leu Lys Tyr He Gly Cys Val He Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 

CCA GTT CCA GGA GGG TTT CGG GTT GCT CTG AAG ACT TTT GAA TTA AAT 1152 
Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 

GGA TAC CAG ATT CCC AAG GGC TGG AAT GTT ATC TAC AGT ATC TGT GAT 1200 
Gly Tyr Gin He Pro Lys Gly Trp Asn Val He Tyr Ser He Cys Asp 
365 390 395 400 

ACT CAT GAT GTG GCA GAG ATC TTC ACC AAC AAG GAA GAA TTT AAT CCT 124 8 

Thr His Asp Val Ala Glu He Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 

GAC CGA TTC AGT GCT CCT CAC CCA GAG GAT GCA TCC AGG TTC AGC TTC 12 96 

Asp Arg Phe Ser Ala Pro His Pro Glu Asp Ala Ser Arg phe Ser Phe 

420 425 430 

ATT CCA TTT GGA GGA GGC CTT AGG AGC TGT GTA GGC AAA GAA TTT GCA 134 4 

He Pro Phe Gly Gly Gly Leu Arg Ser Cys Val Gly Lys Glu Phe Ala 
435 440 445 

AAA ATT CTT CTC AAA ATA TTT ACA GTG GAG CTG GCC AGG CAT TGT GAC 1392 
Lys He Leu Leu Lys He Phe Thr Val Glu Leu Ala Arg His Cys Asp 
450 455 460 

TGG CAG CTT CTA AAT GGA CCT CCT ACA ATG AAA ACC AGT CCC ACC GTG 14 40 

Trp Gin Leu Leu Asn Gly Pro Pro Thr Met Lys Thr Ser Pro Thr Val 
465 470 475 480 

TAT CCT GTG GAC AAT CTC CCT GCA AGA TTC ACC CAT TTC CAT GGG GAA 14 BB 

Tyr Pro Val Asp Asn Leu Pro Ala Arg Phe Thr His Phe His Gly Glu 

485 490 495 

ATC TGA 1494 
He 
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(2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 aming acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Pro Phe Gly Gly Gly Pro Arg Leu Cys Pro Gly Tyr Glu Leu Ala Arg 

1 5 10 15 

Val Ala Leu Ser 

20 



(2) INFORMATION FOR SEQ ID NO; 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Pro Phe Ser Gly Gly Ala Arg Asn Cys He Gly Lys Gin Phe Ala Met 
1 5 10 15 

Ser Glu Met Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH; 20 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
Pro Phe Ser Gly Gly Ala Arg Asn Cys He Gly Lys Gin Phe Ala Met 
1 5 10 15 

Asn Glu Leu Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 amino acids 
CB) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOi9 

Pro Phe Gly Thr Gly Pro Arg Asn Cys He Gly Met Arg Phe Ala He 
15 10 15 

Met Asn Met Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Pro Phe Ser Gly Gly Ser Arg Asn Cys He Gly Lys Gin Phe Ala Met 
15 10 15 

Asn Glu Leu Lys 

20 



(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 
£C) STRANDEDNESS i single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



GAACTCCTCT TTGGAGGACA CGAAACCACG GCCAGTGCAG CCACATCTCT GATCACTTAC 60 

CTGGGGCTCT ACCCACATGT TCTCCAGAAA GTGCGAGAAG AGCTGAAGAG TAAGGGTTTA 120 

CTTTGCAAGA GCAATCAAGA CAACAAGTTG GACATGGAAA TTTTGGAACA ACTTAAATAC 180 

ATCGGGTGTG TTATTAAGGA GACCCTTCGA CTGAATCCCC CAGTTCCAGG AGGGTTTCGG 240 

GTTGCTCTGA AGACTTTTGA ATTAAATGGA TACCAGATTC CCAAGGGCTG GAATGTTATC 300 

TACAGTATCT GTGATACTCA TGATGTGGCA GAGATCTTCA CCAACAAGGA A 351 



(2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TTTTTTTTTT TTGG 14 
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(Z) INFORMATION FOR SEQ ID NO: 13 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 14 base pairs 

(B) TYPE r nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTTTTTTTTT ttgA r 14 



(2) INFORMATION FOR SEQ It) NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

TTTTTTTTTT TTGT 14 



(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TTTTTTTTTT TTGC 14 



(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 16 



(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TTTTTTTTTT TTAA 14 



(2) INFORJ4ATION FOR SEQ ID NO: 18 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TTTTTTTTTT TTAT 14 



(2) INFORMATION FOR SEQ ID NO: 19 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
TTTTTTTTTT TTAC 14 



(2) INFORMATION FOR SEQ ID NO: 20 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
TTTTTTTTTT TTCG 14 



(2} INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY i linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

TTTTTTTTTT TTCA 14 
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(2) INFORMATION FOR SEQ IDNO:22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
TTTTTTTTTT TTCT 



{2} INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TTTTTTTTTT TTCC 14 



(2) INFORMATION FOR SEQ ID NO: 24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
AAGCGACCGA 10 



(2) INFORMATION FOR SEQ ID NO; 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 25 
TGTTCGCCAG 10 



(2) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 10 base pairs 
(B) TYPE: nucleic acid 
{C) STRANDEDNESS; single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: $EQ ID NO: 26 
TGCCAGTGGA 



(2) INFORMATION FOR SEQ ID NO: 27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GGCTGCAAAC 



(2) INFORMATION FOR SEQ ID NO:28 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CCTAGCGTTG 



(2) INFORMATION FOR SEQ ID NO: 2 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GTAGCGGCCG CTGCCAGTGG A 21 



(2) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 12 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GTAGCGGCCG CT 12 
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(2) INFORMATION FOR SEQ ID NO: 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1725 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS; single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 31 

GCACGAGGGA GGCTGAAGCG TGCC ATG GGG CTC CCG GCG CTG CTG GCC AGT 51 

Met Gly Leu Pro Ala Leu Leu Ala Ser 
1 5 

GCG CTC TGC ACC TTC GTG CTG CCG CTG CTG CTC TTC CTG GCG GCG CTC 99 
Ala Leu Cys Thr Phe Val Leu Pro Leu Leu Leu Phe Leu Ala Ala Leu 
10 15 20 25 

AAG CTC TGG GAC CTG TAC TGT GTG AGC AGC CGC GAT CGC AGC TGC GCC 147 
Lys Leu Trp Asp Leu Tyr Cys Val Ser Ser Arg Asp Arg Ser Cys Ala 

30 35 40 

CTC CCC TTG CCC CCC GGT ACC ATG GGC TTC CCA TTC TTT GGG GAA ACA 195 
Leu Pro Leu Pro Pro Gly Thr Met Gly Phe Pro Phe Phe Gly Glu Thr 

45 50 55 

TTG CAG ATG GTG CTT CAG CGG AGG AAG TTT CTG CAG ATG AAG CGC AGG 243 
Leu Gin Met Val Leu Gin Arg Arg Lys Phe Leu Gin Met Lys Arg Arg 
60 65 70 

AAA TAC GGC TTC ATC TAC AAG ACG CAT CTG TTT GGG CGG CCC ACG GTG 291 
Lys Tyr Gly Phe lie Tyr Lys Thr His Leu Phe Gly Arg Pro Thr Val 
75 80 85 

CGG GTG ATG GGC GCG GAT AAT GTG CGG CGC ATC TTG CTG GGA GAG CAC 339 
Arg Val Met Gly Ala Asp Asn Val Arg Arg He Leu Leu Gly Glu His 
90 95 100 105 

CGG TTG GTG TCG GTG CAC TGG CCC GCG TCG GTG CGC ACC ATC CTG GGC 387 
Arg Leu Val Ser Val His Trp Pro Ala Ser Val Arg Thr lie Leu Gly 

110 115 120 

GCT GGC TGC CTC TCC AAC CTG CAC GAT TCC TCG CAC AAG CAG CGA AAG 435 
Ala Gly Cys Leu Ser Asn Leu His Asp Ser Ser His Lys Gin Arg Lya 

125 130 135 

AAG GTG ATT ATG CAG GCC TTC AGC CGC GAG GCA CTC CAG TGC TAC GTG 4 83 

Lys Val lie Met Gin Ala Phe Ser Arg Glu Ala Leu Gin Cys Tyr Val 
140 145 150 

CTC GTG ATC GCT GAG GAA GTC AGC AGT TGT CTG GAG CAG TGG CTA AGC 531 
Leu Val lie Ala Glu Glu Val Ser Ser Cys Leu Glu Gin Trp Leu Ser 
155 160 165 

TGC GGC GAG CGC GGC CTC CTG GTC TAC CCC GAG GTG AAG CGC CTC ATG 579 
Cys Gly Glu Arg Gly Leu Leu Val Tyr Pro Glu Val Lys Arg Leu Met 
170 175 180 185 
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TTC CGC ATC GCC ATG CGC ATC CTG CTG GGC TGC GAG CCG GGT CCA GCG 627 

Phe Arg lie Ala Met Arg lie Leu Leu Gly Cys Glu Pro Gly Fro Ala 

190 195 200 

GGC GGC GGG GAG GAC GAG CAA CAG CTC GTG GAG GCT TTC GAG GAG ATG 675 

Gly Gly Gly Glu Asp Glu Gin Gin Leu Val Glu Ala Phe Glu Glu Met 

205 210 215 

ACC CGC AAT CTC TTC TCT CTT CCC ATT GAC GTG CCC TTT AGC GGC CTG 723 

Thr Arg Asn Leu Phe Ser Leu Pro He Asp Val Pro Phe Ser Gly Leu 
220 225 230 

TAC CGG GGC GTG AAG <3CG CGG AAC CTT ATA CAC GCG CGC ATC GAG GAG 771 

Tyr Arg Gly Val Lys Ala Arg Asn Leu lie His Ala Arg He Glu Glu 
235 240 245 

AAC ATT CGC GCC AAG ATC CGC CGG CTT CAG GCT ACA GAG CCG GAT GGG 619 

Asn He Arg Ala Lys He Arg Arg Leu Gin Ala Thr Glu Pro Asp Gly 

250 255 260 265 

GGT TGC AAG GAC GCG CTG CAG CTC CTG ATT GAG CAC TCG TGG GAG AGG 867 

Gly Cys Lys Asp Ala Leu Gin Leu Leu He Glu His Ser Trp Glu Arg 

270 * 275 280 

GGA GAG AGG CTG GAT ATG CAG GCA CTA AAA CAA TCG TCA ACA GAG CTC 915 

Gly Glu Arg Lgu Asp Met Gin Ala Leu Lys Gin Ser Ser Thr Glu Leu 

265 290 295 

CTC TTT GGT GGT CAT GAA ACT ACA GCC AGT GCT GCG ACA TCA CTG ATC 963 

Leu Phe Gly Gly His Glu Thr Thr Ala Ser Ala Ala Thr Ser Leu He 
300 305 310 

ACT TAC CTA GGA CTC TAC CCA CAT GTC CTC CAG AAA GTT CGA GAA GAG 1011 

Thr Tyr Leu Gly Leu Tyr Pro His Val Leu Gin Lys Val Arg Glu Glu 
315 320 325 

ATA AAG AGC AAG GGC TTA CTT TGC AAG AGC AAT CAA GAC AAC AAG TTA 1059 
He Lys Ser Lys Gly Leu Leu Cys Lys Ser Asn Gin Asp Asn Lys Leu 

330 335 340 345 

GAC ATG GAA ACT TTG GAA CAG CTT AAA TAC ATT GGG TGT GTC ATT AAG 1107 

Asp Met Glu Thr Leu Glu Gin Leu Lys Tyr He Gly Cys Val He Lys 

350 355 360 

GAG ACC CTG CGA TTG AAT CCT CCG GTT CCA GGA GGG TTT CGG GTT GCT 1155 

Glu Thr Leu Arg Leu Asn Pro Pro Val Pro Gly Gly Phe Arg Val Ala 

365 370 375 

CTG AAG ACT TTT GAG CTG AAT GGA TAC CAG ATC CCC AAG GGC TGG AAT 1203 

Leu Lys Thr Phe Glu Leu Asn Gly Tyr Gin He Pro Lys Gly Trp Asn 
380 385 390 

GTT ATT TAC AGT ATC TGT GAC ACC CAC GAT GTG GCA GAT ATC TTC ACT 1251 

Val He Tyr Ser He Cys Asp Thr His Asp Val Ala Asp He Phe Thr 
395 400 405 
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AAC AAG GAG GAA TTT AAT CCC GAC CGC TTT ATA GTG CCT CAT CCA GAG 1299 
Asn Lys Glu Glu Phe Asn Pro Asp Arg Phe He Val Pro His Pro Glu 
410 415 420 425 

GAT GCT TCC CGG TTC AGC TTC ATT CCA TTT GGA GGA GGC CTT CGG AGC 1347 
Asp Ala Ser Arg Phe Ser Phe He Pro Phe Gly Gly Gly Leu Arg Ser 

430 435 440 

TGT GTA GGC AAA GAG TTT GCA AAA ATT CTT CTT AAG ATA TTT ACA GTG 1395 
Cys Val Gly Lys Glu Phe Ala Lys He Leu Leu Lys He Phe Thr Val 

445 450 455 

GAG CTG GCT AGG CAC TGT GAT TGG CAG CTT CTA AAT GGA CCT CCT ACA 144 3 

Glu Leu Ala Arg His Cys Asp Trp Gin Leu Leu Asn Gly Pro Pro Thr 
460 465 470 

ATG AAG ACA AGC CCC ACT GTG TAC CCT GTG GAC AAT CTC CCT GCA AGA 14 91 

Met Lys Thr Ser Pro Thr Val Tyr Pro Val Asp Asn Leu Pro Ala Arg 
475 480 485 

TTC ACC TAC TTC CAG GGA GAT ATC TGATAGCTAT TTCAATTCTT 1535 
Phe Thr Tyr Phe Gin Gly Asp He 
490 495 

GGACTTATTT GAAGTGTATA TTGGTTTTTT TTAAAAATAG TGTCATGTTG ACTTTATTTA 1595 

ATTTCTAAAT GTATAGTATG ATATTTATGT GTCTCTACTA CAGTCCCGTG GTCTTTAAAT 1655 

ATTAAAATAA TGAATTTGTA TGATTTCCCA ATAAAGTAAA ATTAAAAAGT GAAAAAAAAA 1715 

AAAAAAAAAA 1725 



(2) INFORMATION FOR SEQ ID NO: 32 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 97 amino acids 

(B) TYPE: amino acid 

fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

Met Gly Leu Pro Ala Leu Leu Ala Ser Ala Leu Cys Thr Phe Val Leu 
1 5 10 15 

Pro Leu Leu Leu Phe Leu Ala Ala Leu Lys Leu Trp Asp Leu Tyr Cys 

20 25 30 

Val Ser Ser Arg Asp Arg Ser Cys Ala Leu Pro Leu Pro Pro Gly Thr 
35 40 45 

Met Gly Phe Pro Phe Phe Gly Glu Thr Leu Gin Met Val Leu Gin Arg 
50 55 60 

Arg Lys Phe Leu Gin Met Lys Arg Arg Lys Tyr Gly Phe He Tyr Lys 
65 70 75 80 
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Thr His Leu Phe Gly Arg Pro Thr Val Arg Val Met Gly Ala Asp Asn 

85 90 95 

Val Arg Arg He Leu Leu Gly Glu His Arg Leu Val Ser Val His Trp 

100 105 HO 

Pro Ala Ser Val Arg Thr He Leu Gly Ala Gly Cys Leu Ser Asn Leu 
115 120 125 

His Asp Ser Ser His Lys Gin Arg Lys Lys Val He Met Gin Ala Phe 
130 135 140 

Ser Arg Glu Ala Leu Gin Cys Tyr Val Leu Val He Ala Glu Glu Val 
145 150 155 160 

Ser Ser Cys Leu Glu Gin Trp Leu Ser Cys Gly Glu Arg Gly Leu Leu 

165 170 175 

Val Tyr Pro Glu Val Lys Arg Leu Met Phe Arg He Ala Met Arg He 

180 185 190 

Leu Leu Gly Cys Glu Pro Gly Pro Ala Gly Gly Gly Glu Asp Glu Gin 

195 200 205 

Gin Leu Val Glu Ala Phe Glu Glu Met Thr Arg Asn Leu Phe Ser Leu 
210 215 220 

Pro He Asp Val Pro Phe Ser Gly Leu Tyr Arg Gly Val Lys Ala Arg 
225 230 235 240 

Asn Leu He His Ala Arg He Glu Glu A3n He Arg Ala Lys He Arg 

245 250 255 

Arg Leu Gin Ala Thr Glu Pro Asp Gly Gly Cys Lys Asp Ala Leu Gin 

260 265 270 

Leu Leu He Glu His Ser Trp Glu Arg Gly Glu Arg Leu Asp Met Gin 
275 280 265 

Ala Leu Lys Gin Ser Ser Thr Glu Leu Leu Phe Gly Gly His Glu Thr 
290 295 300 

Thr Ala Ser Ala Ala Thr Ser Leu He Thr Tyr Leu Gly Leu Tyr Pro 
305 310 315 320 

His Val Leu Gin Lys Val Arg Glu Glu He Lys Ser Lys Gly Leu Leu 

325 330 335 

Cys Lys Ser Asn Gin Asp Asn Lys Leu Asp Met Glu Thr Leu Glu Gin 

340 345 350 

Leu Lys Tyr He Gly Cys Val He Lys Glu Thr Leu Arg Leu Asn Pro 
355 360 365 



Pro Val Pro Gly Gly Phe Arg Val Ala Leu Lys Thr Phe Glu Leu Asn 
370 375 380 
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Gly Tyr Gin lie Pro Lys Gly Trp Asn Val lie Tyr Ser lie Cys Asp 

385 390 395 400 

Thr His Asp Val Ala Asp lie Phe Thr Asn Lys Glu Glu Phe Asn Pro 

405 410 415 



Asp Arg Phe He Val Pro His Pro 

420 

He Pro Phe Gly Gly Gly Leu Arg 
435 440 

Lys He Leu Leu Lys He Phe Thr 
450 455 

Trp Gin Leu Leu Asn Gly Pro Pro 
465 470 

Tyr Pro Val Asp Asn Leu Pro Ala 

485 



Glu Asp Ala Ser Arg Phe Ser Phe 
425 430 

Ser Cys Val Gly Lys Glu Phe Ala 

445 

Val Glu Leu Ala Arg His Cys Asp 

460 

Thr Met Lys Thr Ser Pro Thr Val 
475 480 

Arg Phe Thr Tyr Phe Gin Gly Asp 
490 495 



He 



(2) INFORMATION FOR SEQ ID NO:33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 1Q NO: 33 



CGCACCCCAG GAGGCGCGCT CGGAGGGAAG CCGCCACCGC CGCCGCCTCT GCCTCGGCGC 60 

GGAACAAACG GTTAAAGATT TTGGGCCASC GCCTCCGCGG GGGGAGGAGC CAGGGGCCCC 120 

AATCCCGCAA TTAAAGATGA ACTTTGGGTG AACTAATTGT CTGACCAAGG TAACGTGGGC 180 

AGCAACCTGG GCCGCCTATA AAGCGGCAGC GCCGTGGGGT TTGAAGCGCT GGCGGCGGCG 240 

GCAGGTGGCG CGGGAGGTCG CGGCGCGCCA TGG 273 



(2) INFORMATION FOR SEQ ID NOt34 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 
(c) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

CGCACCCCCA GGAGGCGCGC TCAGAGGGAA GCCGCCAGTG CGCCGCCTCT GCCTCGGCGC 60 

GGAACAAACG GTTAAAGATT TTTTTGGGCA GCGCCTCGAG GGGGGAGGAG CCAGGGGCCC 120 
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GATCCGCAAT TAAAGATGAA CTTTGGGTGA ACTAATTTGT CTGACCAAGG TAACGTGGGC 
AGTAACCTGG G.CG6CCTTAT AAAGAGGGCG CGCGGCGGGG TTCGGAGCTA GGGAGGCGGC 
GGCAGGTGGC GCGGGAGGCT GAAGCGTGCC ATGG 



(2) INFORMATION FOR SEQ ID NO: 35 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



GTTGCGCGCT TCAACATGG 



(2) INFORMATION FOR SEQ ID NO; 36 

(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 2677 base pairs 
O) TYPE: nucleic acid 

(C) STfcANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 

iao 

240 
300 

319 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
TCGGGGGAAT TAACACCTTT TCAAAGTGAA ATCTCAGGAT TGTCTGCCTT CTACAGGAGG 
TGGTATTAAA ATGCGCCTAT AACAAATGGT TGAGAGTTTG GAGCCGCTTC TGCCCTGTGG 
GCGGGGCGAG ATGACACCAC AATTAAAGAT GAACTTTGGG TGAACTAATT TATCTGAGGA 
AGTTAACAGG AGGAGACCTG CGCGCAATGG ATATATAAGG GCGCGCAGGC GAGGACGCCC 
TCAGTTTGTG CGTAAAGACG CGTCTCCTCT CCAGAAGCTT GTTTTTCGTT TTGGCGATCA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36 

GATCCCAGAT CTGCCTATTG CGCCCGATGC CCCGAGGCTC TCTCTTGGAC TCTGGCCCTG 60 

AGTTCTTCTG CGCGATCCTT CGGAGACGTC TGGAGGCCTG CTTTATGCAT CTCTCTTGGA 120 

CCTCAGTTTC CCCACACGTG GGAGGAGGCA GCTGGACGAT TCCTGAAAGG ACTTTCCCTT 180 

GCTTCCTCAT CACGTGGAAG AGAGCCCACC CGGCACCTGG AAATGGAAAG CCAGTGAAGG 240 

CTGCTTTGGG CCGGGGCAKC GGGTGGGACC GGGCGGGAGG GATTCCAAAG AGACCGCCGG 300 

GAAGGCTAGA GCTTGGAATT CCGGCTCCTC GGAGTCCTGG CCCTCCCCCA CCGCCGCCTC 360 

GGAGCTCAGC ACACCTTGGA TGGGGGAGGC GGGCAGCTCC TAGCCCCGCA CCCCAGGAGG 420 

CGCGCTCGGA GGGAAGCCGC CACCGCCGCC GCCTCTGCCT CGGCGCGGAA CAAACGGTTA 4 90 

AAGATTTTGG GCCASCGCCT CCGCGGGGGG AGGAGCCAGG GGCCCCAATC CCGCAATTAA 540 

AGATGAACTT TGGGTGAAC1" AATTGTCTGA CCAAGGTAAC GTGGGCAGCA ACCTGGGCCG 600 
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CCTATAAAGC GGCAGCGCCG TGGGGTTTGA AGCGCTGGCG GCGGCGGCAG GTGGCGCGGG 660 

AGGTCGCGGC GCGCCATGGG GCTCCCGGCG CTGCTGGCCA GTGCGCTCTG . CACCTTCGTG 720 

CTGCCGCTGC TGCTCTTCCT GGCTGCGATC AAGCTCTGGG ACCTGTACTG CGTGAGCGGC 7 BO 

CGCGACCGCA GTTGTGCCCT CCCATTGCCC CCCGGGACTA TSGGSTTCCC CTTCTTTGGG 840 

GAAACCTTGC AGATGNTACT NCAGGTAAGG GAGGGTGGGG CGGGACAGGC TGCTTCCCCG 900 

GAGCCCGGCG CGGCTCTGGG CTTCTGCTGA AGTCGGGGTA GGCGCCCCCG GGAGGCATGC 960 

TATTGCGGCT AGGAGCAGGG CTGGCGGGAG CGCGGCGCTC CCCGGMKYMC SCTCAWGCSC 1020 

RCWWKTMWCC TCCGCCTYMC TCCCAMAGCG GARSAARWKC YKGMRGATGA AGCGCAGGAA 1080 

ATACGGCTTC ATCTACAAGA CGCATCTGTT CGGGCGGCCC ACCGTACGGG TGATGGGCGC 1140 

GGACAATGTG CGGCGCATCT TGCTCGGAGA GCACCGGCTG GTGTCGGTCC ACTGGCCAGC 1200 

GTCGGTGCGC ACCATTCTGG GATCTGGCTG CCTCTCTAAC CTGCACGACT CCTCGCACAA 1260 

GCAGCGCAAG AAGGTGGGGG CAGGAGGCGA CGGCTGGACA GGGAGGGGGA CCCCATTTAT 1320 

GAGCGGAATT CCGGCTGATG GATGCTAGGC GCGGGCTAGC AGCTTGAGGT GGGCTAGGAC 1380 

CCTCTGCCAG CTCCAGGTTA GCTTTCCCAG CTCGGAGAGT GCCATGTGTC TGGCAGGACT 1440 

GGGGGTGTCT GGAAGGGGAC GGCGGTAGAC GAGAGGGGCG GATGGAGGCT TTTAACGCTG 1500 

TCCCCTCCTC GGGACTCAGG TGATTATGCG GGCCTTCAGC CGCGAGGCAC TCGAATGCTA 1560 

CGTGCCGGTG ATCACCGAGG AAGTGGGCAG CAGCCTGGAG CAGTGGCTGA GCTGCGGCGA 1620 

GCGCGGCCTC CTGGTCTACC CCGAGGTGAA GCGCCTCATG TTCCGAATCG CCATGCGCAT 1690 

CCTACTGGGC TGCGAACCCC AACTGGCGGG CGACGGGGAC TCCGAGCAGC AGCTTGTGGA 1740 

GGCCTTCGAG GAAATGACCC GCAATCTCTT CTCGCTGCCC ATCGACGTGC CCTTCAGCGG 1800 

GCTGTACCGG GTAAGGGCGG CAAACGGGCT GCGGACTAGG GGCGCGGGAC CTGGGCGTCT I8 60 

GCTCACCGCC GCGCGCTCTC TGCGCTCAGG GCATGAAGGC GCGGAACCTC ATTCACGCGC 1920 

GCATCGAGCA GAACATTCGC GCCAAGATCT GCGGGCTGCG GGCATCCGAG GCGGGCCAGG 1980 

GCTGCAAAGA CGCGCTGCAG CTGTTGATCG AGCACTCGTG GGAGAGGGGA GAGCGGCTGG 2040 

ACATGCAGGT GAGTAGCAGC TTCAGACCAG GCACTGCGGA GrTTGGTCCC CTGGCTTTCC 2100 

AAGGCGCTGT TCCTGGGGCC CCCAAAGCGC GCGCCTGGGG CCCAGCTTTC TGGAGTGGGC 2160 

GGCCGGCTCA GACTACAGCT ATGGAATCCC GAAGGAAGGC TGAGACACCC GGTCAGGAGA 2220 

GCTGCGGAAG GGGCTGCGGM GGAAACTGGG AGCATCCCCT AGCCTTTAMC AGGTTTCAAA 2280 
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GGGAAAGTTG GAATTTGCAA AAATGTTAAT AAAGAACCTT GCGATTTTAA TAAAACTAAG 2340 

ACTTTAACTC AGGAGTTTCC GGTAGRGCGG GGTCGTACTC GCCTTACTGC TCCAGCTGAA 2400 

CTAAAGGGAC GTTGCATTTT GTTTAAAGAT ATTGCTTTCC TTGACTTTCT GTCAGCAAAA 24 60 

CATTTAGCCC TTCTAGTCTT CCCTCCAGAA CTCTCAGTTC GATTCTGAGT AATCCTTCTG 2520 

TCAAACCGCA GGCAGACTTG TGAGAATGTG GGTCTCACTC TATTCTTAGG CACTAAAGCA 2580 

ATCTTCAACC GAACTCCTCT TTGGAGGACA CGAAACCACG GCCAGTGCAG CCACATGTCT 264 0 

GATCACTTAC CTGGGGCTCT ACCCACATGT TCTCCAG 2677 



(2) INFORMATION FOR SEQ ID NO: 37 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 



GATCCAGGTT GCTGAAACAT ATCTCCATAT AGGGCAGAAG AATTATCAAA AGCATAAGAA 60 

TTGCAGCCAC AGCATAGGGA AGAAAGAGGA GTTTTTAAAC CACAACAAAA GGGAGAAAGA 120 

AGAGAATTTT AACTTACATT TAATTCAAAA GTCTTCAGAG CAACCCGAAA CCCTCCTGGA 180 

ACTGGGGGAT TCAGTCGAAG GGTCTCCTTA ATAACACACC CGATGTATYT AAGTTGTTCC 240 

AAAATTTCCA TGTCCAACTT GTTGTCTTGA TTGCTCTTGC AAAGTAAACC CTAYCAAAAY 300 

AGTCATACAG AGGTGAACAG TYATTTTGTG CTCCAATTAA AATCAGCCCA GCAGACGTAA 360 

ACAGGGCTTA AGTGGAGACT AAACCCAAAG GGCCCCATGA TGGGAGAGAC TGGGAGGGGG 420 

AAACAGCAGC TAATGGCCAT TTGCCTGCCC AAATCCACTA TCTATTTACA ATCCCAGGAG 480 

AATGCTGCTC ACCAGTTAGA AGGACCAAGT TTCTCCCCAC GCCCCCCCAC CCCACACTCA 54 0 

CCACCACCAC CCACACTAAT CAGCTATTCA CACTATGTAT GCCCTTGGAC ACACCAATTC 600 

AAGAAAAGTG GAACCTATCT GAGAATCTCC ACGGTTCACA AAAAGGTGGA GGAGGGGTAG 660 

GAATACAAGG TCAAACCCTG CCC 683 



(2) INFORMATION FOR SEQ ID NO: 38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4164 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PAGE 45/49 * RCVD AT 4/2912004 2:04:25 PM [Eastern Daylight Time] ' SVR:USPT0-EFXRF-1/3 * DNIS:8729306 * CSID:416 865 7380 * DURATION (mm-ss): 12-32 



APR-29-2004 14=17 TORYS LLP TORONTO 416 865 7380 P. 46 

- 25/28 - 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 

TCGCGAGGAG CGACCACGGC TTGAAGAGGG GTAGACGAGA CCAGATGCTC CCCGGCGCCC 60 

CCTCATGCGG GTTGCGGTCT CTCTCCTCCA CCTCCCTCTC AGCGGAGGAA GTTTCTGCA.G 120 

ATGAAGCGCA GGAAATACGG CTTCATCTAC AAGACGCATC TGTTTGGGCG GCCCACGGTG 1B0 

CGGGTGATGG GCGCGGATAA TGTGCGGCGC ATCTTGCTGG GAGAGCACCG GTTGGTGTCG 240 

GTGCACTGGC CCGCGTCGGT GCGCACCATC CTGGGCGCTG GCTGCCTCTC CAACCTGCAC 300 

GATTCCTCGC ACAAGCAGCG AAAGAAGGTG AGGGTGAGCT GGCAACTCCT TGGCTGGCAG 360 

GGAGACCTCA TCCTATGGCT TGGTTCAGGC AAAATAGAAT GCGGGGCGAG GGCTAGTCCT 420 

ATGTGGTGGG GACCAGGACC CTCTCTATCT GAGATCCACT TTAGCTTTTC TGCTAGCACG 480 

TGGGTTAGTC CTGGGGGGGA CTGAAATTCT TGAAAGGGTA CTCGGAAAGG CGAAGGGGGG 540 

GGGGCTGAGG GAAAGTAGAG GATTGTAACA CTCTCTGCTC CTGGGGGGTG CTCAGGTGAT 600 

TATGCAGGCC TTCAGCCGCG AGGCACTCCA GTGCTACGTG CCCGTGATCG CTGAGGAAGT 660 

CAGCAGTTGT CTGGAGCAGT GGCTAAGCTG CGGCGAGCGC GGCCTCCTGG TCTACCCCGA 720 

GGTGAAGCGC CTCATGTTCC GCATCGCCAT GCGCATCCTG CTGGGCTGCG AGCCGGGTCC 780 

AGCGGGCGGC GGGGAGGACG AGCAGCAGCT CGTGGAGGCT TTCGAGGAGA TGACCCGCAA 840 

TCTCTTCTCT CTTCCCATTG ACGTGCCCTT TAGCGGCCTG TACCGGGTAA GGGCGGTTTG 900 

CGGAGTCGGA GTAGGGGAAC GCAAGCTCGG GCATCCGCTC ACCGCCACGC TCTCTCCGCG 960 

CTCAGGGCGT GAAGGCGCGG AACCTTATAC ACGCGCGCAT CGAGGAGAAC ATTCGCGCCA 1020 

AGATCCGCCG GCTTCAGGCT ACAGAGCCGG ATGGGGGTTG CAAGGACGCG CTGCAGCTCC 1080 

TGATTGAGCA CTCGTGGGAG AGGGGAGAGA GGCTGGATAT GCAGGTGAGA AGCAATTTCA 1140 

AAAGGTGCCA AGGGCCGGGG AGTGCCTCTG ACTTTCCAGA CACACTTTCT GGGGTCTCCA 1200 

AAGCCCTGTC AAGGCCCCAG CTACTTCCAA GTGGGCGGCG ATGCTAGGTC TAGAGCTTTT 1260 

CAACCTGTGG GTCGTGACCC CTTCACGGAG CCAAACAACC GTTTCAGAAG GGTCGCCTAA 1320 

GAGCATCTGC ATATCCGATA TTTACATCAA GAAACATAAC AGTAGCAAAA TTACCGTTAT 1380 

GAAGTAGCAA CAAAGATAAT TTTATCGTTG GGGGTCACCA CAACACGAGG AACCGTATTA 14 4 0 

AAGGGTGGCA TTGGTCTAGA GAGCTGTGGA AGGGGGTGGC TGAGCAATGG GGAAGATCCC 1500 

AAAGTTCAAA GGGCAAGGCT CATCTACAAA GGTTAAAGCG GAAGAGCAGG ATTAAGGGAG 1560 

TTTTGCGTTT TTGTTTGTGG TCTTTGACTT TCTATGAACA AAACGGATTT TACCCTTGAA 1620 

GTCTTCCGTG CAATATTCTC AGGTCAGGTC TTTGTAACAG TGCTATAAAC TGCACTCAGA 1680 
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TCTGTATAAA CTTCCGTTTT TATCCTTAGG CACTAAAACA ATCGTCAACA GAGCTCCTCT 1740 

TTGGTGGTCA TGAAACTACA GCCAGTGCTG CGACGTCACT GATCACTTAC CTAGGACTCT 1900 

ACCCACATGT CCTCCAGAAA GTTCGAGAAG AGATAAAGAG CAAGGTAGGA TGATTCTAGA IB 60 

GGTTCCCCAT TTGCCTAGGA CATTCCTCTA TTAACCACCA CCACCACCCC CXCTGTATAT 1920 

AAGTTTGCTC GATACACCCA GTACTATGAC AGTGAAGATC TGAGAGCTAG GTGGGACTGT . 1990 

GGGGGAGAGA CTCCACCTCG TGAATTTAAA AAGGCAGTTG TTTGTACTGG GCTCTCTCTT 2040 

GGGCAGAATT TGACCCTCTC CTCCTCCTCC TCCTCCTCCT CCTCTTCCTC CTCCACCACC 2100 

ACCACCATCA CCACCTTTTA TAGAGCAAGG TTCTCCTTTC CCTGACCAAG AACATGAATA 2160 

ATGTGATTAG AGCCAATAGC TGATCAGGGT CGCAGTGTTG GTGAGGGCTC AGGGTATGAC 2220 

CCTTTATATA CCTGATAAGC AACATTGTCT GGATAATGGG TTTAGGCTGA GGAAGTGTGG 2280 

AAAGGAAGGC CATCAGGCCA TCAGCTCTTT CCCTTTTATC CTCTCCCATC CAGACGCCTT 2340 

CAGGTTTAGT TAACAGGTGA GTCCTGCTGG GCTGACTTTT TTTTTGGAGT GCCCAGGGAT 2400 

CCATCACTCA CTTTTTTATC TGTTTCCATA GGGCTTACTT TGCAAGAGCA ATCAAGACAA 2460 

CAAGTTAGAC ATGGAAACTT TGGCACAGCT TAAATACACT GGGTGTGTCA TTAAGGAGAC 2520 

CCTGCGATTG AATCCTCCGG TTCCAGGAGG GTTTCGGGTT GCTCTGAAGA CTTTTGAGCT 2580 

GAATGTGAGT GCACCTCCTG TCCCCCACCC CCAGCCCTCG TCCACGTCCA CTCTGCTATG 2 640 

CTGTTGAGCA TCAGCTGCCC AGAGCAGTGG CTCACTGCCC TTGACAGTGT CCTGCCTCCT 2700 

ATGGTACTGG GAACCAATTT GCTCTCCTCT CTTAATGCCA TCCATGCTAG TAATGACTTT 2760 

TTGTTGTTGC AAGCTCAGGG CCGGGATTGT CAATTCTTAG GATTTTTTTT TTTTTTTAAA 2820 

CAGGGATACC AGATCCCCAA GGGCTGGAAT GTTATTTACA GTATCTGTGA CACCCACGAT 2S80 

GTGGCAGATA TCTTCACTAA CAAGGAGGAA TTTAATCCCG ACCGCTTTAT AGTGCCTCAT 2940 

CCAGAGGATG CTTCCCGGTT CAGCTTCATT CCATTTGGAG GAGGCCTTCG GAGCTGTGTA 3000 

GGCAAAGAGT TTGCAAAAAT TCTTCTTAAG ATATTTACAG TGGAGCTGGC TAGGCACTGT 3060 

GATTGGCAGC TTCTAAATGG ACCTCCTACA ATGAAGACAA GCCCCACTGT GTACCCTGTG 3120 

GACAATCTCC CTGCAAGATT TACCCACTTC CAGGGAGATA TCTGATAGCT ATTTCAATTC 3180 

TTGGACTTAT TTGAAGTGTA TATTGTTTTT TTTAAAATAG TGTCATGTTG ACTTTATTTA 3240 

ATTTCTAAAT GTATAGTATG ATATTTATGT GTCTCTACTA CAGTCCCGTG GTCTTAAATA 3300 

TTAAAATAAT GAATTTGTAT GATTTCCCAA TAAAGTAAAA TTAAAAAGTG CTTCTCTTGC 3360 
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TTTTTAAGAT TCTTGTTGGC AAGCTGCCCA TGGTGGTACA TTGCTGTAAT ACTAGGACTT 3420 

GGAAGGTGGA GGCAAGAAGA GCAGGCATTC AAGGCTAGCC TGGGCTACAG AAATCCTGTC 34 80 

TTAAACAAAC ACTACAACAA AAAGTCCTGT TAGGGAATCT GACTGGCTCA GTGTTTGTAC 3540 

TTTGTGTATT TAAAATGATT TAGAGTGAAA CCATAGGTCT CTCCCCCATG TCAGAAAATA 3600 

TATATTATTA TGTGTATGCT GATCCAAAGT ATCTTTGTAA CTTTTTCTAA GGTCATTGAG 3660 

ACTTCATATT TTGAAATTGT ATGGAGGCTA GTTATATTAC ATTATTTATT TATTTATTTA 3720 

TTTACATTTT TATGGTGCTG GGGATTGGAT CGAAGGCTTC ACACCTCTAG GGCAAGCCCT 37 80 

TTGTCATTAA GGCGCTGCCT CTCCCTTTCA GCCCAACGTT AATTCTAGAT TCTTTTTCTT 3840 

TGGTGCTTTT GGGAGGTAAA CCTGGGATGC TGCAGTTATT TGGTGGTGGT CGTTGGTTTT 3900 

ACTCTAGAGA GAAGGCAACT TTGGGAAGGC AACACTGCTG CTGGTGAGTC GGGAAGCATC 3960 

ATCCCAGAGC AACGGGGTCA GCATAGCTAA CATTTTAAAT CAGCATAATG AATCCCTGTC 4 020 

ATATGGAGGA GGCAGAACTC CTCTTTGAAG TTGATATTTT AGATAAGACA GAGCCAGCCC 4080 

CTCTGGTTAT GGACAGTTCT TACCCAAAAT GAAACAGAGA AGAAAACCAC TGGTGTGTCA 4140 

CCTTTCCTTA GAAGTGCTTC AGGA 4164 

(2) INFORMATION FOR SEQ ID NO: 39 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANPEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Each N can represent any nucleotide 

and there can be 0 to 5 N 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 39 
TGAACTNNNN NTGAACT 17 



{2) INFORMATION FOR SEQ ID NO: 40 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPEi nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
TCTGASSAAG KTAAC 15 
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(2) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CAATTAAAGA 10 



(2) INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 42 

CAATTAAAGA TGAACTTTGG GTGAACTAAT T 31 



(2) INFORMATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
GTAGCACGGA TGGTG 15 
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